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Abstract 

The parallel dynamics of extremely diluted symmetric Q-Ising neural net- 
works is studied for arbitrary Q using a probabilistic approach. In spite of 
the extremely diluted architecture the feedback correlations arising from 
the symmetry prevent a closed-form solution in contrast with the extremely 
diluted asymmetric model. A recursive scheme is found determining the 
complete time evolution of the order parameters taking into account all 
feedback. It is based upon the evolution of the distribution of the local 
field, as in the fully connected model. As an illustrative example an ex- 
plicit analysis is carried out for the Q = 2 and Q = 3 model. These results 
agree with and extend the partial results existing for Q = 2. For Q > 2 
the analysis is entirely new. Finally, equilibrium fixed-point equations are 
derived and a capacity-gain function diagram is obtained. 
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1 Introduction 

For the parallel dynamics of extremely diluted asymmetric and layered feed- 
forward Q > 2-Ising neural networks recursion relations for the relevant order 
parameters have been obtained in closed form (cfr. [|1]]-[§J and the references cited 
therein). This has been possible because in these types of networks one knows 
that there are no feedback correlations as time progresses. 

For the parallel dynamics of networks with symmetric connections, however, 
things are quite different. Even for extremely diluted versions of these systems 
it is known that feedback correlations become essential from the second time 
step onwards, complicating the dynamics in a nontrivial way. As a consequence 
explicit results concerning the time evolution of the retrieval overlap for these 
symmetrically diluted models have been obtained for the Q = 2 case up to the 
third time step only ||-||. Furthermore, the local instability of neighbouring 
trajectories has been examined, recently, invoking the ansatz that the second 
step formula for the retrieval overlap stays valid for all times it>2 0. So, up to 
now, no systematic analytic procedure is available, even for Q = 2, for calculating 
the complete time evolution taking into account all feedback correlations. The 
main purposes of this paper are to fill this gap and extend the results to general Q. 

The method used is a probabilistic signal-to-noise analysis || but starting 
from the distribution of the local field instead of working directly with the order 
parameters. Recently, a complete solution for the parallel dynamics of fully con- 
nected Q-Ising networks at zero-temperature has been obtained in this way ||. 
Similar to the fully connected architecture, and in contrast with the extremely 
diluted asymmetric and layered network architectures, the local field contains 
both a discrete and a normally distributed part. The difference with the fully 
connected model is that the discrete part at a certain time t does not involve 
the spins at all previous times t — 1, i — 2, ... up to but only the spins at time 
step t — 1. But, again this discrete part prevents a closed-form solution of the 
dynamics. Nevertheless, we succeed in developing a recursive scheme in order to 
calculate the complete time evolution of the order parameters - the retrieval over- 
lap and the activity - taking into account all feedback correlations. In this way 
we have completed our discussion of the parallel dynamics at zero temperature 
for the different architectures - asymmetric and symmetric extremely diluted, 
layered feedforward and fully connected - considered in the literature. 

As an illustration we write out these expressions in detail for the first five 
time steps of the dynamics. For Q = 2 our results agree with the first three time 
steps available in the literature and extend these by a systematic analytic 
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procedure. Furthermore we find that the ansatz put forward in strongly over- 
estimates the retrieval overlap. For Q > 3 our results are new and do give a clear 
picture of the evolution of the network in the retrieval regime. 

Finally, by requiring the local field to become time-independent implying that 
some correlations between its Gaussian and discrete noise parts are neglected we 
can obtain fixed-point equations for the order parameters. For Q = 2 they 
coincide with those derived via thermodynamical methods flQ |. For Q > 3 they 
are not given in the literature before. In this case we obtain for the first time the 
structure of the capacity-gain parameter diagram. 

The rest of the paper is organized as follows. In Section |2] we introduce the 
model, its dynamics and the Hamming distance as a macroscopic measure for the 
retrieval quality. In Section |3| we use the probabilistic approach in order to derive 
a recursive scheme for the evolution of the distribution of the local field, leading 
to recursion relations for the order parameters of the model. The differences 
with other architectures are outlined. Using this general scheme, we explicitly 
calculate in the Appendix the order parameters for the first five time steps of 
the dynamics. In Section f| we discuss the evolution of the system to fixed-point 
attractors. For Q = 2, 3 a detailed discussion of the theoretical results obtained in 
Section [3] and the Appendix is presented in Section ||. Some concluding remarks 
are given in Section |. 



2 The model 

Consider a neural network A consisting of N neurons which can take values 
Oi from a discrete set S = { — 1 = si < s 2 < . . . < sq = +1}. The p patterns 
to be stored in this network are supposed to be a collection of independent and 
identically distributed random variables (i.i.d.r.v.), {£f G S}, n G V = {1, . . . ,p} 
and i 6 A, with zero mean, E[^] = 0, and variance A = Var[£f]. The latter 
is a measure for the activity of the patterns. Given the configuration cr^(t) = 
{<Jj(t)}, j G A = {1, . . . , A^}, the local field in neuron i equals 

hi(<T A \ {i} (t)) = ]T JijVjit) (1) 
jeA\{i} 

with Jij the synaptic couplings between neurons i and j. In the sequel we write 
the shorthand notation h^i(t) = hi(cr\\[iy(t)). 

The network is taken to be extremely diluted but symmetric meaning that the 
couplings are chosen as follows. Let {c^- = 0, G A be i.i.d.r.v. with distri- 
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bution Pr{cij = x} — (1 — C/N)8 X $ + (C/N)8 x> i and satisfying Cjj = Cji, Cu = 0, 
then 

'h -TTaYI f ^ ' / ./ • (2) 

Compared with the asymmetrically diluted model jTTJ] the architecture is still a 
local Cayley-tree but no longer directed and in the limit N — ■> oo the probability 
that the number of connections Tj = {j G A | = 1} giving information to the site 
i G A is still a Poisson distribution with mean C = E[\Ti\}. Thereby it is assumed 
that C <C log AT and in order to get an infinite average connectivity allowing to 
store infinitely many patterns p one also takes the limit C —>■ oo and defines the 
capacity a by p = aC. However, although for the asymmetric architecture, at 
any given time step t all spins are uncorrelated and hence no feedback is present; 
for the symmetric architecture this is no longer the case, causing a feedback from 
t > 2 onwards |J. This feedback complicates the dynamics. 

The following dynamics is considered. The configuration (T\(t = 0) is chosen 
as input. At zero temperature all neurons are updated in parallel according to 
the rule 

<?i(t) -> + 1) = s k : mm ei[s\(T A \ {i} (t)] = ei[s k \cr A \ {i} (t)} . (3) 
Here the energy potential ej[s|cr A \{j}] is defined by 

e t [s | <r A \ {l} ] = - - [hi (cr A \ {i} ) s - bs 2 } , (4) 

where b > is the gain parameter of the system. The updating rule ([3]) is 
equivalent to using a gain function g 6 (-), 

<7i(* + l) = g b {h Ali {t)) 
Q 

Sb( x ) = ^2 s k[0 [b{s k +i + s k )-x]-8 [b(s k + s fc _i) - x]] (5) 
fc=i 

with so = — oo and sq + i = +oo. For finite Q, this gain function g 6 (-) is a step 
function. The gain parameter b controls the average slope of g&(-). 

To measure the retrieval quality of the system one can use the Hamming 
distance between a stored pattern and the microscopic state of the network 



(6) 
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This naturally introduces the main overlap 

^(*) = 4rEer^(*) (7) 

and the arithmetic mean of the neuron activities 

°a(*) = ^ EM*)] 2 • ( 8 ) 

We recall that in the thermodynamic limit C, A" — > oo all averages will have to 
be taken over the treelike structure, viz. XlieA ~~ *■ ^eiy We remark that for 
Q = 2 the variance of the patterns A = 1, and the neuron activity a^(t) = 1. 



3 Recursive dynamical scheme 

As mentioned above in an extremely diluted network the symmetric couplings 
cause non-trivial correlations, even at zero temperature, which are known to 
become increasingly tedious to evaluate ||-||. So, results on the dynamics for 
these symmetric systems existing up to now concern Q = 2 only and are restricted 
to the first three time steps. 

Using a probabilistic approach (see, e.g.,H,[B|) we calculate the distribution 
of the local field for a general time step for Q > 2 systems analogously to the 
fully connected case studied very recently @. This allows us to obtain recursion 
relations determining the full time evolution of the relevant order parameters. 

Suppose that the initial configuration of the network {cjj(O)}, % G A, is a collec- 
tion of i.i.d.r.v. with mean E[<7j(0)] = 0, variance Var[erj(0)] = ao, and correlated 
with only one stored pattern, say the first one {^} : 

E[^(0)] = SiJ^mlA ml>0. (9) 

This implies that by the law of large numbers (LLN) one gets for the main overlap 
and the activity at t — 

m l (0) = lim m\(0) ^ ~EfcV 4 (0)] = m l (10) 

C,N—+oc J\ 

o(0) = lim o A (0) = E[t7?(0)] = o (11) 
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where the convergence is in probability ||12|| . Writing the local field at t = as 



hi(0) 



lim 

C,N-+oc 



^(0) + 



CA 



»eV\{l}j£Ti 



where 



'121 



(13) 



and where we recall that Tj is the part of the tree connected to neuron j, we find 
using standard signal-to-noise techniques (see, e.g., |6|1 , [|TT| ) 

hi(0) = gm 1 (0)+M(0,aa o ), (14) 

where the convergence is in distribution jJEJ. The quantity A^(0, V) represents a 
Gaussian random variable with mean and variance V. At this point we note 
that this structure of the distribution of the local field at time zero - signal plus 
Gaussian noise - is typical for all architectures treated in the literature. 

The key question is then how these quantities evolve in time under the parallel 
dynamics specified before. For a general time step we find from eq. (|5|) and the 
LLN in the limit C, N — > oo for the main overlap ([?]) and the activity (j8|) 



m 1 (t + 1) 
a(t + l) 



Pr 



Pr 



((dm)))) 



(15) 
(16) 



with hi{t) = lim^Ar^oo h^i{t). In the above ((■)) denotes the average both over the 
distribution of the embedded patterns {£f } and the initial configurations {o"j(0)}. 
The average over the initial configurations is hidden in an average over the local 
field through the updating rule (|5|). 

From the work on fully connected networks we know that due to the 
correlations we have to study carefully the influence of the non-condensed patterns 
in the time evolution of the system, expressed by the variance of the residual 
overlaps. The latter is defined as 



r M (t) = lim 



Ti 



lim — 

C,N^oo j\ 



(17) 



The aim of this section is then to calculate the distribution of the local field 
and the order parameters as a function of time. This can be done for arbitrary 
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Q without making the notation much heavier. The simplifications for Q = 2 will 
be mentioned when appropriate. 

We start by rewriting the local field ([[]) at time t in the following way 



h Kl {t) = iimUt) + 



7^ E £f4(t). 



We first concentrate on the residual overlap r^.(t), fi G V\{1}. Since the neurons 
{(Jj(t)\j G Tj} are not i.i.d.r.v. the central limit theorem (CLT) can not be applied 
directly to this residual overlap. Therefore we follow a procedure similar to that 
used for the fully connected model || by taking out of the local field precisely 
the contributions arising from these dependences. In order to do so we apply the 
dynamics writing the residual overlap (O) as 



(19) 



In analogy with the fully connected case we expect contributions coming from the 
fact that the a jit + 1) are dependent on the <Ji(t) for all j G T; and from the fact 
that the <Jj(t+ 1) and £j* are dependent (the latter dependence is microscopic but 
leads, at least for a fully connected architecture, to a macroscopic contribution 
in the thermodynamic limit). Therefore, we define a modified local field 



1 



V<- uep\{i,n,} 



E mxt) 



(20) 



Expanding g 6 (-) in 

rfe(*+l) 
1 



) around h^j(t) we arrive at 
1 

i 



-jje E E [i^i^.WI - 16(-* + W 



sign(6(s fc + Sfc+i) - hl tj (t)) + sign(-j=£fr£. 



(*)) 



(21) 



with the Heaviside function. At this point we remark that instead of using 
explicitly the set of indices where the term -^^,j r Tj (t) becomes relevant in the 
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gain function g 6 (-) (see [|] for more details), we replace it, more conveniently, by 
the sum over j of the Heaviside function. 

We then consider the limit C, N —>■ oo. First we remark that in this limit the 
density distribution of the modified local field hf (t) at time t equals the density 
distribution of the local field hi(t) itself. Hence, applying the CLT to the first 
term of (^l|) gives, recalling eqs. (|16|) and (p0|) 

r»{t) = o Jm o £ ^M/t)) S A/"(0, a(f + 1)/A) (22) 

because of the weak dependence of hj(t) and To the second term of (pip we 
apply the LLN arriving at a contribution given by the average of 

^4 (() (^w + s k ))^\\r^{t)f + X (tm\ 2 r£fi) (23) 

over {C,j}, {cfj(O)} and where 

Q-i 

= £ - s k)f^(t)( b ( s k+i + s fc )) . (24) 
fe=i J 

Here is the probability density of the modified local field hf(t) at time t and 
/^M( t ) denotes its derivative w.r.t. the argument. Since the residual overlap r^.{t) 
depends explicitly on j, it is also averaged over, while in the fully connected case 
the residual overlap r£(t) is kept fixed. Doing these averages none of these terms 
contribute to r M (t + 1) as expressed by (pip . 

This implies that r M (t+l) is Gaussian with variance a(t+l)/A, in contrast with 
both the fully connected architecture [[J and the layered architecture f|], where 
the variance contains extra terms. This finishes the treatment of the residual 
overlap. We remark that for Q = 2 the following simplifications are possible: 
b(s k+1 + s k ) = 0, (sjfe+i - s k ) = 2, g 6 (-) = sign(-) and a(t) = A = 1. 

However, subtleties arise in the treatment of the local field at time t + 1. We 
have to take into account the correlations between £f and r j, . Starting from the 
form ( |18D and using eqs. fl2~l|) -([2"3"D we find that the second part of (p3|) leads to a 
contribution from the site i in the local field hi(t + 1) given by ax{t)<Ji{t) . This is 
different from the asymmetrically diluted case, where the probability to find the 
site i in is zero such that we do not get any contribution at all. Furthermore, 
for the fully connected case, where all sites of the residual overlap in the second 
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From this it is clear that the local field at time t consists out of a signal 
term, a discrete noise part and a normally distributed noise part. The variance of 
the local field is aA times the variance of the residual overlap. Furthermore, the 
discrete noise and the normally distributed noise are correlated and this prohibits 
us to derive a closed expression for the overlap and activity (eqs. (p!5|)-(p^)). 

These correlations, which become more complicated as time evolves, will de- 
termine the distribution of the local field, fhi{t) m eq. (0). Using the evolution 
equation crj(i) can be replaced by gb(hi(t — 1)) such that the discrete noise part is 
a stepfunction of correlated variables. These are correlated through the dynam- 
ics with the normally distributed part of hi(t). Therefore the local field can be 
considered as a transformation of a set of correlated normally distributed vari- 
ables x s , s = t — 2[£/2], . . . , t — 4, t — 2, t, which we choose to normalize. The 
brackets [t/2] denote the integer part of t/2. Defining the correlation matrix 
w = (p(s, s')) = (E[x s x s ']) we arrive at the following expression for the probabil- 
ity density of the local field at time t 



With X= {x P -2[t/2],...X tr -2,X t ). 

In conclusion, eqs. (|H)|) , (|T6"|) together with eqs. (|23J)-(^(|) form a recursive 
scheme in order to obtain the order parameters of the system. 

As an illustration we write down in the Appendix the evolution equations for 
the order parameters of a general Q > 2-Ising network for the first five time steps, 
taking into account all correlations. Five time steps suffice to give an accurate 
picture of the dynamics in the retrieval regime of the network as we will see in 
Section ^. In the literature only the first three time steps of the Q = 2 model 
are known. Our results completely solve the parallel dynamics for any Q > 2. 
This also allows us to determine precisely the effects of neglecting the correlations 
between the Gaussian and discrete part of the noise, i.e., an overestimate of the 
main overlap. 




s=0 



X 




(26) 
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4 Fixed-point equations 

A second type of results can be obtained by requiring through the recur- 
sion relations (|25[) that the local field becomes time- independent. This means 
that some of the discrete noise part is neglected. We show that for Q = 2 this 
procedure leads to the same fixed-point equations as those found from a ther- 



modynamic replica symmetric mean-field theory approach in ||10|| . For Q > 2, 
however, these fixed-point equations are new since no replica results are available 
in the literature. 

In the extremely diluted and layered Q-Ising models the evolution equations 
for the order parameters do not change their form as time progresses, such that 
the fixed-point equations are obtained immediately by leaving out the time de- 
pendence (see |0, Q). This still allows small fluctuations in the configurations 
{<j;}. Similar to the fully connected model the form of the evolution equations 
for the order parameters in the symmetrically diluted model treated here does 
change by the explicit appearance of the {cj(£)} term, such that we can not use 
that procedure to obtain the fixed-point equations. Instead we require that the 
distribution of the local field given by ( f25|) becomes independent of time. This is 
an approximation because fluctuations in the network configuration are no longer 
allowed. It implies that the main overlap and activity in the fixed-point are found 
from the definitions (0), (||) and not from leaving out the time dependence in the 
recursions relation flloj ) and (0). 

So, eliminating the time-dependence in the evolution equations for the local 
field ( ^5|) one obtains 

fn = gm 1 + Af(0, aa) + OL\Oi (27) 

with hi = lim^oo hi(t). Employing this expression in the updating rule (|) one 
finds 

Oi = g b (hi + axoi) (28) 

where hi = M{^}m l ,aa) is the normally distributed part of eq. (|27|). Compared 
with the fully connected model (see || eqs. (79) and (80)) one sees that the 
equations here have a similar structure but simpler coefficients in front of the 
noise terms, viz. the denominator 1 — % is replaced by 1. Therefore the same 
method of solution as in the fully connected CcLSG, 1.6. , cl geometrical Maxwell 
construction (see also iT3|,|14||) can be employed. 



To make the discussion self-contained we shortly recall this method. Let L be 
the straight line which connects the centers of the plateaus of the gain function 
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g&(-). The equations for the functions g 6 (-) and L(-) read 

g b : x i-> s fc if 6(s fe + s fc _i) < x < 6(s fc + s fc+ i) (29) 
L : (30) 

The condition on the r.h.s. of fl29|) is a condition on x. Using the definition of 
L(-), one can transform this into a condition on the image of L(-), II = {y G 
IR | 3 x e JR : L(x) = y}, viz. 

g(> (x) = Sl if^=i<iW<^±i. (31) 
Consider the transformation T : (x, y) \— > {x — 2/) 

— 2(6^)' (32) 

The function T(g 6 )(-) is not bijective while T is not one-to-one. To obtain a 
unique solution for eq. @8|) we modify the former function such that it becomes 
a step function with the same step height as the one in T(g b )(-) and the width of 
the steps such that T(L) connects the centers of the plateaus: 

T L (g b )(x) = s k if (b - ^)0 fe + sjfc_i) < x < (b - ^-)(s k + s k+1 ) (33) 

or, using (|) 

T L (g 6 )(x) = g b (x) with b = b-^. (34) 

This at first sight ad-hoc modification leads us to a unique solution of the self- 
consistent equation (P8|). Indeed, from this modified transformation we know 
that 

g b (h + axo) ^ gs(A + a\cr - axg b (h + ol%o)) , (35) 

such that 

<*i = &bC h i) ■ (36) 

Using the definition of the main overlap and activity (0) and in the limit 
C,N — > oo, one finds in the fixed point 



m 1 



i ^ / Z>* gg (^m 1 + )) (37) 

J Dsgg^W + v^))) , (38) 
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where we recall that 



= 0-—, x 



Vz zg- b (^m 1 



aaz 



(39) 



For the special case of Q = 2 the resulting equations fl3"7|)-(P9"D are the same as 
those derived from a thermodynamic replica-symmetric mean-field theory treat- 
ment in fl(|. For general Q > 2 such a comparison can not be made because 
a thermodynamic treatment is not yet available in the literature. Hence it is 
interesting to write down these equations in more detail here 



m 



X 
b -- 



si 



2A 

4 + 4 

2 
1 



V2 



iraa 



b-^ 



2A 



J2( s k+i ~ s k) ((^erf 



- b(s k+1 + s k ) 



k=l 
-1 



aa 



S k+1 S k) 



erf 



£W - b(s k+1 + s k ) 



k=i 



aa 



\k=l 



Sk) exp 



- (fW - b(s k+1 + 
2aa 



»(40) 
(41) 

(42) 
(43) 



We are presently working out such a thermodynamic approach for these systems 
at arbitrary temperature [113 . 



5 Numerical results 

The equations derived in Section [3] and the Appendix have been studied nu- 
merically for the Q = 2, 3 model with equidistant states and a uniform distribu- 
tion of patterns, implying that A = 1 for Q = 2 and A = 2/3 for Q = 3. 

In the Q = 2 case the temperature-capacity phase diagram given by a ther- 
modynamic replica-symmetric mean-field theory approach has been presented in 
||10|| . From that work we know that the critical capacity at zero temperature 
equals 0.634. So we discuss the parallel dynamics using the complete recursive 
scheme developed here for a typical point in the retrieval regime, e.g., a = 0.3. 
In Fig. 1 we show the overlap m 1 (t), t = 1 to 5 versus the initial overlap with 
the condensed pattern (thick lines). (We forget about the superscript 1). For 
m > 0.4 we see that the retrieval attractor is reached quickly. In fact four or five 
time steps give us already an accurate picture of the dynamics in the retrieval 
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Figure 1: The overlap m(t) for Q = 2 systems is presented for the first five time 
steps as a function of m for a = 0.3. The results for the first, second, third, 
fourth and fifth time step are indicated by a dotted, a short-dashed, a dashed- 
dotted, a long-dashed and a full curve respectively. The results using the ansatz 
of ref. [7] for the third, fourth and fifth time step are indicated with thin lines. 

region. At this point we note that the convergence to the attractor is of an oscil- 
lating nature, in contrast with the asymmetrically diluted model. The first three 
time steps were given in the literature before (||-|J) and they agree completely 
with our results. 

In this figure we also indicate the overlap (thin lines) when making the ansatz 
that the structure of the formula for the second time step is valid for any time 
step. This ansatz neglects some correlations and it systematically overestimates 
the overlap for all m . This effect becomes even stronger when going from time 
step 3 to 5. 

Concerning the Q = 3 model, in order to determine the retrieval regime of 
the network, we first have to solve the fixed-point equations (pE0"|)-(pE3"D derived in 
Section |] since a thermodynamic phase diagram is not available in the literature. 
The resulting capacity-gain diagram is presented in Fig. 2. We discover three 
different regions in the retrieval regime. In region I (bounded by the straight 
full line and the dashed-dotted line) the activity a is of order 1, and b < 0. 
Consequently we call it the Ising-like region. In region II and III, a is of order 
A (and b > 0). The difference between II and III, separated by a short-dashed 
line, is that there is no sustained activity state (m = fl,a ^ 0) present in the 
latter. The zero solution is always a fixed-point. We remark that the boundary 
of the retrieval region is denoted by a full line when the transition (to the spin- 
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b 

Figure 2: The a — b phase diagram for Q = 3. The full curve and long-dashed 
curve indicate the boundary of the retrieval region. The dashed-dotted curve 
denotes the boundary between the Ising-like region I and the other regions. The 
short-dashed curve represents the boundary between regions II and III. The points 
1 to 4 indicate the network parameters used in the discussion of the dynamics. 




glass phase) is continuous and by a broken line when it is first-order. For further 
details on this diagram, which are not relevant to our present discussion we refer 
to (15[. 



For specific network parameters corresponding to some arbitrarily chosen 
points in the retrieval phase in this equilibrium phase diagram, indicated as 1 
to 4, we have studied the dynamics governed by the evolution equations found 
here. Figures 3-5 present an overview of these results by plotting the overlap 
m(t), the activity a(t) and the Hamming distance d(t) versus the initial overlap 
m with the condensed pattern. The initial activity is taken to be = 0.83. 

In the Ising-like region I, the network parameters corresponding to point 1 are 
a = 0.5,6 = 0.1. We see in Fig. 3 that the Hamming distance, d(t), reaches its 
plateau value indicating retrieval for mo > 0.6. The corresponding value of the 
main overlap is about 0.83 and the activity goes to values larger than A = 2/3. 
So, the network configuration is no longer uniformly distributed: the state cij = 
has a smaller probability to appear than the states Oi = ±1. Hence, the Hamming 
distance can never be that small. 

A somewhat similar type of behavior is found in the fully connected model 
in the region where the order parameter giving the mean-square random overlap 
with the non-condensed patterns is of order 10 |§. 

For a network corresponding to point 2 in region II of the phase diagram 
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Figure 3: The overlap m(t), the activity a(t) and the Hamming distance d(t) 
are presented for the first five time steps as a function of m for the network 
parameters b = 0.1, a = 0.5, a = 0.83. The curves for the different time steps 
are as in Fig. 1 
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Figure 4: As in Fig. 3, for the network parameters b = 0.5, a = 0.3, a = 0.83. 



with a = 0.3, b = 0.5 we see in Fig. 4 that for m > 0.5 the dynamics evolves 
to the retrieval attractor with main overlap m = 0.84. For these values of m 
the Hamming distance quickly goes to its plateau value (smaller than the one 
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Figure 5: As in Fig. 3, for the network parameters b = 0.6, a = 0.1, a = 0.83. 

of point 1). The activity then attains a value near A meaning that the network 
configuration is uniformly distributed. We remark that the boundary between 
the retrieval attractor and the zero attractor becomes clearly visible, especially 
in the behavior of m(t) and a(t). In this region we have also considered the point 
labeled as 3, i.e., a = 0.6,6 = 0.5. There the behavior of the dynamics is very 
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similar to point 1, and hence we have not drawn a corresponding figure, except 
for the fact that the activity a goes to a value near A (as for point 2) indicating 
again a uniformly distributed network configuration. 

In region III of the phase diagram for the network parameters corresponding 
to point 4 with a = 0.1, b = 0.6, we need a greater value of m to reach the 
retrieval attractor. As seen in Fig. 5, m has to be at least 0.65. But the value 
of the main overlap is larger, m = 0.93, and the Hamming distance is smaller 
than for the other network parameters. Furthermore, the boundary between the 
retrieval attractor and the zero-attractor is sharply determined. 

6 Concluding remarks 

An evolution equation is derived for the distribution of the local field governing 
the parallel dynamics at zero temperature of extremely diluted symmetric Q > 
2-Ising networks. For the first time, all feedback correlations are taken into 
account. In contrast with extremely diluted asymmetric and layered networks 
and in analogy with fully connected models this distribution is no longer normally 
distributed but contains a discrete part. 

Employing this evolution equation a general recursive scheme is developed 
allowing one to calculate the relevant order parameters of the system, i.e., the 
main overlap and the activity for any time step. This scheme has been worked 
out explicitly for the first five time steps of the dynamics. 

Under the condition that the local field becomes time-independent, meaning 
also that some of the discrete noise part is neglected, fixed-point equations are 
obtained for these order parameters. For Q > 2 these equations were not given in 
the literature before. Hence, we have shortly discussed the capacity-gain diagram 
for the Q = 3 model. 

As an illustration a detailed discussion of the dynamics is given for the Q = 2 
and Q = 3 model. It is seen from these numerical results that the first four or five 
time steps do give already a clear picture of the time evolution in the retrieval 
regime of the network. For Q = 2 the ansatz that the structure of the formula for 
the second time step is valid for all times t > 2, neglecting any further correlations 
between the Gaussian and discrete part of the noise, strongly overestimates the 
retrieval overlap. 
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Appendix: Evolution equations for the first few 
time steps 

Al: First step dynamics 

The order parameters for the first time step can be written down immediately 



using eqs. (]T2| ) , (]T5|) and (jig) 



m\l) = Jvzg b {em\Q) + ^[^)z)fi (44) 

a(l) = l^j Vz gj^m^O) + v^(0) z)J) , (45) 

where ((■ • •)) now indicates the average taken with respect to the distribution of 
the first pattern and the initial configuration and T>z denotes a Gaussian measure 
Vz = rfzexp(-i2 2 )/ v / 27T. 



A2: Second step dynamics 

First we need the distribution of the local field at time t = 1 which can be 
read off from eqs. ( p6] ) 

hi(l) = £ m\l) + ax(0)<r<(0) + jV(o, aa(l)) . (46) 



with (see eq. O) 

X(0) = (( y=y / 132 2 B^MO) + V^(0) *))) • ( 4? ) 
Recalling again eqs. ([H|) and ([IB]), the main overlap and the activity read 

m\2) = l^ 1 |^g,^W(l) + a X (0)a(0) + V / ^W^^ (48) 

a(2) = ^|p^g^eW(0) + ax(0M0) + ^(l)^))) (49) 

A3: Third step dynamics 

We start again by writing down the distribution of the local field at time 
t = 2. 

h t (2) = £W(2) + a X (l)<Ti(l) + Af(0, aa{2)) (50) 
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where (eqs. 



Jaa(l) W J v 



(51) 



This gives for the main overlap 



ml (3) = J ( ^(2) + a X {l)a{\) + ^ aa{2) y 



(52) 



with y the Gaussian random variable W(0, 1). The average has to be taken over 
y, o"j(0) and <7i(l). The average over £ x and <7j(0) causes no difficulties because 
this initial configuration is chosen randomly. The average over y, the Gaussian 
random variable appearing in hi(2), and 0"j(l) is more tricky because hi(2) and 
crj(l) are correlated by the dynamics. However, the evolution equation @ tells 
us that Cj(l) can be replaced by g 6 (/tj(0)) and, hence, the average can be taken 
over /ij(0) instead of <7j(l). 

From the recursion relation (ESJ) one finds for the correlation coefficient be- 
tween hi(0) and hi{2) 



P(2,0) 



(7(0) |Dzg 6 (eW^+axW^W + ^Hl)^ 



'o(0)o(2) 

Using all this the main overlap at the third time step (B^) becomes 
m x (3) = i^ 1 /^ 2 '°(^2/)&(^ 1 (2)+ 



(53) 



5 6 (^ 1 m 1 (0) + A /^(0)ar) +^aa(2)y 



where the joint distribution of x and y equals 



Vw ZM (x,y) 



dx dy 



2nJl-p{2,0y 



cxp 



x 2 - 2p(2,0)xy + y 2 
2(l-p(2,0) 2 ) 



(54) 



(55) 



In an analogous way one arrives at the expression for the activity at the third 
time step 



o(3) 



gbC^m^O) + y/aa(0)x)] + ^/aa(2)j/ 



(56) 
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From these results, in particular the absence of the <7j(0)-term in (|50| ) we see 
that feedback loops over two time steps exist, but the probability to have loops 
over a longer time period equals zero. If the dilution is asymmetric 0, even all 
feedback disappears and the local field is simply Gaussian distributed. For the 
special case of Q = 2 these results agree with the results available for the first 
three time steps in the literature 0-0. 

A4: Fourth step dynamics 

Again, from eqs. (|24|)-(|2"6"|) we find 

hi(3) =e> 1 (3) + ax(2)(7i(2)+jV(0,aa(3)). (57) 



with 



X (2) = 11 - 1 [Vzz f Vyg b (em{2) 

\\ v /aa(2)(l-p(2,0) 2 )^ J \ 



+ ax{l)9b(^m(0) + y/aa(0)y) 



+ yjaa(2)(l - p(2, 0) 2 ) z + yjaa(2) p(2, 0)y j J J . (58) 

This leads to the main overlap for the fourth time step 

m\A) = i ^fg 6 (^(3) + ax{2)*(2) + yU(3)^ (59) 

with z the Gaussian random variables jV(0, 1). The average has to be taken over 
z, CTj(O), and <Tj(2) or recalling the evolution equation (|5|) over ^,z, crj(O) and 
/ij(l). The distribution function of these variables, i.e., Dw 3,1 (x,y) is given by 
eq. fl55|) with the index 2 replaced by 3 and by 1. The correlation coefficients be- 
tween the fields at different time steps can again be calculated from the recursion 



relation (25) 



p(3,l) = // / Vw 2 '°(x,y)g b feW(O) + ^M0)x 

xg 6 W(2) + a X (l)g b (Vm^O) + yW(0)x) + y^(%) 



(60) 
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Using all this eq. fl59|) becomes 

ml(4) = / p ™ 3,1 (^)& (V™ 1 ^) 

+ a X (2)g b UWil) + ax(0)<7(0) + y/aa(l)x" 
+ V / H3)2/)}) ■ (61) 
In an analogous way the activity at the fourth time step can be calculated 

+ a X (2)g b + ax(0)<r(0) + y/cta(l)x) 

^(3)y))) • (62) 
Finally we present the calculations for the fifth time step. 

A5: Fifth step dynamics 

The local field hi(4) reads 

hi(4) =eV(4)+ax(3)a i (3)+AT(0,«a(4)). (63) 

The expression for x(3) has an analogous form as the one for x(2) since the 
structure of hi(3), needed to calculate it, is similar as the one of hi(2) 



X(3) 



= (( i 1 [Vzz [Vyg b (Vm(3) 

\\^/aa(3)(l-p(3,l)V J V 

+ a X (2)^(^m(l) + ax(0)a(0) + ^(1) y) 



+ v /aa(3)(l - p(3, l) 2 ) z + yjaa(3) p(3, l)y J J J . (64) 
The overlap at time step five is then given by 

m\5) = j (^W(4) +a X (3)a(3) + y/^j (65) 
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with z the Gaussian random variables jV(0, 1). The average has to be taken over 
z, (Ti(0), and <7j(3). Rewriting the network configuration {o"j(3)} by means 
of the gain function (|^) the local fields at time steps 0, 2 and 4 appear. The 
distribution function of these three local fields equals 

4,2,0/™ ,. \ _ dx dy dz I 1 / \ . A2,o\-i 



Vw ' ' (x, v, z) = : exp — x y z (w 

(27r) 3 /VDetw 4 < 2 >° 1 1 1 

' '(66) 
where 

/l p(2,0) p(4,0)\ 

p(2,0) 1 p(4,2) (67) 
Vp(4,0) p(4,2) l] 

with the correlation coefficients p(2, 0), p(4, 0) and p(4, 2). 

The correlation coefficients between the fields at different time steps not men- 
tioned before can again be calculated using the relation (|2~5|). They read 



w 4,2,0 



p(4,0) = / J (a(0) [Vw s '\x,y) 
\M0)a(4) W J 

xg b UW(3) + a X (2)g b (^(1) + «x(0)<t(0) + ^/aa(l)a; 



+Vcm(% (68) 



and 



p(4,2) = —^ = //fvw 3 '\x,y)g b (em\l) + ax(0)a(0) + ^^)x" 
yja(2)a(i) W \ v > 

xg fe ^W(3) + a X (2)g fe + a X (0)a(0) + ^Ml)x^ 

+ V^(3)y))) • (69) 
Using all this eq. fl65"D becomes 

ml(5) = -^(^7 Pw4 ' 2 ' 0(x ' ?/ ' 2)gfe ( elml(4) 



+ "X(3)g fe KW(2) + ax(l)g6 (^(O) + yjaa(0)x) + sjaa{2)y 



+ Jaa(i)z . (70) 
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In an analogous way the activity at the fifth time step can be calculated 
a\5) = jT>w^\x,y,z)gl(ern\A) 

+ «x(3)g b (Vm 1 ^) + a X {l)g b (f W(0) + ^/ao(O)ar) + y/aa(2)y^ 

+ ^MA) Z yjj. (71) 

Since the numerical results explicitly show that five time steps do give already 
an accurate picture of the dynamics in the retrieval regime of the network we do 
not write out further steps. 



A6: Remarks on Q=2 



We recall that for Q = 2 the following simplifications are possible: b(sk+i + 
s k) — 0, (sk+i — Sfc) = 2, g 6 (-) = sign(-) and a(t) = A = 1. Since the first three 
time steps for this special case are derived in the literature @-|J we give here, 
as an illustration, the formula for the overlap for t = 4 

l + o-m^O) 



m '(4) = E fv^\z,y) 



<T=±1 



sign (m 1 (3) + «x(2) sign(m 1 (l) + cr«x(0) + \fax) + \faty) -(72) 

In comparison with the fully connected model we remark that only two correlated 
Gaussians are present here and the term in x(l) is absent. Furthermore, the 
ansatz used in for the study of neighbouring trajectories amounts to neglecting 
the correlations between the Gaussian and discrete part of the noise. The result 
of this approximation is to take all correlation coefficients p(t, t — 2) = (such 
that, e.g., the matrix w 3,1 in the formula ([72]) above becomes the unit matrix). 



